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(57) L 'invention conceme des banques combinatoires de 
composes biochimiques marques ainsi que leurs 
precedes de production. Ces precedes consistent a 
produire des unites individuelles marquees, a combiner 
au moins deux de ces des unites individuelles marquees 
de maniere a produire un compose biochimique marque, 
et a repeter ce processus au moins une fois de maniere a 
produire une banque combinatoire de composes 
biochimiques marques. L 'invention cencerae egalement 
des precedes permettant d'etablir la conformation d'un 
compose biochimique, lesdits precedes consistant a 
produire une banque combinatoire de composes 
biochimiques marques, a mettre cette banque 
combinatoire de composes biochimiques marques en 
contact avec une molecule receptrice cible afin qu'un 
compose biochimique marque selectionne se lie a la 
molecule receptrice cible, et enfin a etablir la 
conformation du compose biochimique marque 
selectionne apres qu'il se soit lie a la molecule receptrice. 



(57) Combinatorial libraries of labeled biochemical 
compounds and methods for producing such 
combinatorial libraries comprising the steps of 
producing labeled individual units, combining at least 
two of the labeled individual units so as to produce a 
labeled biochemical compound, and repeating this 
process at least once so as to produce a combinatorial 
library' of labeled biochemical compounds. Also, 
methods for detemining the conformation of a 
biochemical compound which comprise producing a 
combinatorial library of labeled biochemical 
compounds, contacting the combinatorial library of 
labeled biochemial compounds with a target receptor 
molecule so that a selected labeled biochemical 
compound binds to the target receptor molecule, and 
determining the conformation of the selected labeled 
biochemical compound when bound to the receptor 
molecule. 
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COMBINATORIAL LIBRARIES OF LABELED BIOCHEMICAL 
COMPOIMDS AHD METHODS FOR PRODUCING SAME 

Field of the Invention 

The present invention relates to labeled 
5 coiabinatorial libraries and methods for their 

production. These labeled combinatorial libraries can, 
among other functions, aid in the determination of the 
3 -dimensional structures of candidate drugs bound to 
their target receptor molecules, regardless of whether 
10 ox not the structure of the target receptor molecule is 

known. 

Backq-round of the Invention 

Since Erhiich ( Ber. Dtsch. Chem. Ges . . Vol. 42, 
p. 17 (1909)) first postulated that a drug functions by 

15 binding to a particular molecular target in the body- 
called a "receptor," there has been intense interest in 
the use of receptor binding to elucidate structure- 
function relationships in drug research- Historically, 
natural sources of materials such as fungi and plants 

2 0 have been screened for active compounds, and indeed 
these sources provided such vital drugs as the 
anti-bacterial agent penicillin and the anti-cancer 
agent vincristine. As the structures of biologically 
active compounds have been determined, organic chemists 

2 5 would modify the naturally occurring active compounds 
in a search for f lirther active compounds . The 
interplay between such naturally occurring compounds 
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and their chemical derivatives form the foundations of 
research in the modern pharmaceutical industry . 

More recently, two new techniques for designing 
biologically active compounds have been developed. The 
5 first technique. Structure Based Drug Design, seeks to 
design a drug by first determining the 3 -dimensional 
("3D") structure of the target receptor molecule and 
then designing the drug "to fit" the target receptor 
molecule . The structure of the target receptor 

10 molecule can be determined either by crystallizing the 
target protein and examining the crystals using X-ray 
crystallography (see, e.g., Agouron' s work with HIV-l 
protease disclosed at the Chemical & Biomolecular 
Diversity Conference held in San Diego, CA, December 

15 14-16, 1994 (hereinafter referred to as the "CBD 

Conference")) or by nuclear magnetic resonance ("NMR") 
(see, U.S. Patent No. 5,324,658 and references cited 
therein) . From the structure of the whole molecule, 
the researcher can identify that part of the molecule - 

20 the "binding site" - to which the active molecule drug 
binds . A putative drug is then designed using 
computational techniques to fit the exact space and 
shape of the binding " sit e . Using this technique, a 
number of promising drug candidates have been designed, 

25 including inhibitors of thymidylate synthase (Appelt, 
K., et al., J. Med. Chem. , 34:1925 (1991)) and several 
inhibitors of HIV-l protease (see, disclosures at the 
CBD Conference) . 

The second technique, the "combinatorial library" 

3 0 approach, employs virtually the opposite approach. 
This technique involves the synthesis of as many 
different compounds as possible and the seieccion of 
candidate compounds by screening them for binding 
activity against the target receptor molecule of 
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interest . One approach is to synthesize a huge number 
of oligopeptides by randomly reacting mixtures of amino 
acids. For example, twenty amino acids randomly 
combined into hexameric peptides will produce no less 
5 than 64 million compounds. The resulting mixture of 

compounds, or "library" of compounds, is then added to 
the target receptor molecule of interest and those 
compounds that bind to the target receptor molecule 
with the greatest affinity are selected as candidate 

10 compounds . 

Many -variations of this approach have been 
attempted. In addition to using mixtures of amino 
acids to generate oligopeptide libraries [Lam et al . , 
Nature , 354:82 (1991)), other examples of such 

15 variations include chemically derxvatizing oligopeptide 
libraries (Houghten, Current Biology , Vol. 4, No. G, 
p. 564 (1994)), and using libraries of oligonucleotides 
(Wang et al.. Biochemistry , 32:11285 (1993)) and 
organic medicinal molecules (see, disclosures by 

2 0 Pharmacopeia at the CBD Conference) . 

The most advanced chemical combinatorial library 
techniques include methods and strategies for the rapid 
screening, purification and identification of lead 
compounds . Examples of such techniques include 
25 synthesizing the compounds on polystyrene beads (Rapp, 
Technical Bulletin, Rapp Polymer GmbH, Eugenstrasse 
3 8/1, D72072, Tubingen, Germany and references cited 
therein) and tagging techniques (see, disclosures by 
Pharmacopeia at the CBD Conference) . Such techniques 

3 0 enable a lead compoixnd to be selected and identified 

from a library of numerous compounds relatively quickly 
(e.g., in a few weeks) . 

Despite these advances, there are drawbacks to 
both of these aforementioned techniques , most notably 
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in the time, and hence cost, it takes to finally design 
the optimum drug for a given purpose. For instance, in 
the structure-based-drug-design approach, the 
determination of a high resolution three-dimensional 
5 molecular structure of a target receptor molecule by 

either X-ray crystallography or NMR is time" consuming, 
frequently taking many years. The design and testing 
of a drug from such structures is also time consuming, 
typically requiring at least several iterations of the 

10 design-synthesis-testing cycle. 

Moreover, in the combinatorial library approach, 
although a group of compounds that bind to a given 
target receptor molecule can be selected very rapidly 
by this approach, the next step - establishing a 

15 so-called "structure- function" relationship - is 

fraught with difficulty. Ideally, to maximize the 
efficiency of the ultimate drug product, it is 
imperative to determine i) those parts of the selected 
compounds which are binding to the target receptor 

20 molecule; ii) those parts of the selected compounds 
which are structurally important for binding to the 
target receptor molecule (i.e., those parts important 
in providing distance, bracing, etc. to the binding 
functions of the compounds) ; and iii) those parts of 

25 the selected compounds which are unnecessaary for 

binding. Unfortunately, the most common and convenient 
chemical strategy of combinatorial library techniques 
is to produce oligomers such as oligopeptides . As 
discrete compounds, these oligomers are inherently 

30 "floppy" with no distinct molecular structure. 

Consequently, it is very difficult, if not impossible, 
to infer the conformation of these compounds in the 
bound state . 
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In practice, the structure-function relationship 
is much more easily deteinnined if the structure of the 
target receptor molecule, and especially that of the 
binding site, is already known. Under these 
5 circumstances, the structures of the selected compounds 
bound to the target receptor molecule possibly can be 
inferred. However, as noted above, the structure of 
the target receptor molecule may not be known or may 
take many years to determine. Therefore, if the 

10 structure of the target receptor molecule is not known, 
many of the advantages of the speed of the 
combinatorial library approach are negated. 

Several techniques using NMR have been developed 
for determining the conformation of a small molecule 

15 ligand bound to its receptor. These techniques require 
that either the ligand or its receptor or both be 
labeled with the NMR active stable isotopes ^^C and/or 
"N. For instance, recently, Schreiber and his co- 
workers ( Science , Vol. 266, p. 1241 (1994)) described a 

20 technique whereby the bound conformations of specific 
molecules could be determined using NMR. The 
conformations of the bound molecules were determined in 
two ways. In the first method, the molecules were 
bound to a protein target which had been enriched with 

25 the NMR active isotopes "C and by expression in E. 
coli cultured in an "C and ^^N medium. In this case, 
the "C- and "-^N- isotopes were used to transfer 
magnetization from the labeled protein to the unlabeled 
molecules bound to the labeled protein. The structure 

3 0 of the molecules in their bound conformation was 
thereby determined. 

In the second method, the molecules themselves 
were labeled by expression in E. coli . In this case, 
the -^C- and ^^N- isotopes were used to transfer 
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magnetization to the protons of the ligands and thereby 
determine the conformation of the molecules in their 
bound conformation. 

Earlier, Fesik and his co-workers (Nature, Vol. 
3S1, p. 83 {1993}) determined the conformation of 
cyclosporin A bound to its target molecule, cyclophilin 
[cites] . These workers not only studied cyclosporin 
bound to labeled cyclophilin, but also studied ^^C-- 
labeled cyclosporin bound to unlabeled cyclophilin. 
Both methods were successful in determining the bound 
conformation of cyclosporin A. 

A third technique for determining the conformation 
of a ligand bound to its target receptor involves 
labeling the ligand with "C and the receptor with ^^N or 
vice versa. All three of the techniques discussed 
above have the advantage over techniques used for the 
elucidation of the 3D structure of the receptor itself 
in that their utility is relatively unaffected bv the 
size of the ligand/receptor complex {for a detailed 
review of NMR techniques, see , Burger's Medicinal 
Chemistry and Ding Discovery, Vol. 1, Principles and 
Practice, Ed. Manfred E. Wolff, pages 314-315 and 
references cited therein) . 

All of the above methods are significant advances 
in the methodology of determining the conf oirmation of 
candidate drugs bound to their target receptor 
molecules. However, these methods have drawbacks with 
regard to the time-consuming determination of the 
structure -function relationship required by the 
combinatorial library approach. For example, Schreiber 
et al. labeled their target molecules in bacteria. In 
contrast;, many proteins, particularly many proteins of 
medical interest, cannot be expressed in bacteria and 
hence cannot be labeled in this way. Although a 
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procedure for producing proteins labeled with "C- and 
"N- isotopes in mamnialian and insect cells has recently- 
been described (see, U.S. Patent No. 5,324,558), the 
construction and selection of recombinant mammalian 
5 cell lines is time consuming, frequently taking a year 
or more . 

The use of isotopically- labeled compounds in the 
combinatorial library approach is desirable because of 
its potential for reducing the time needed to determine 

10 the conf ozmations of the bound molecules. The use of 
such labeled compounds would increase the speed with 
which the conformations of candidate drugs bound to 
their target receptor molecules could be determined. 
Moreover, the use of such compounds would enable the 

15 speedy determination of candidate drugs using the 

combinatorial libra2ry approach even if the structure of 
the target receptor molecule was not known. 

To date, however, the use of such com.pounds in the 
combinatorial library approach has been impractical, if 

20 not impossible, because no means of producing a 

combinatorial library of labeled compounds has been 
available. Thus, a need exists for methods of 
producing combinatorial libraries of labeled compounds 
which can be used in the combinatorial library approach 

25 to drug design. 

Summary of the Invention 

In accordance with the present invention, 
combinatorial libraries of labeled biochemical 
compounds and methods for producing such combinatorial 
30 libraries include (1) producing the individual units of 
the biochemical compounds in labeled form, (2) 
combining at least two of the labeled individual units 
so as to produce a biochemical compound in labeled 
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form, and (3) repeating step (2) at least once so as to 
produce a combinatorial library of labeled biochemical 
compounds . 

Also in accordance with the present invention, 
methods for determining the conformation of a 
biochemical compound include (1) producing a 
combinatorial library of labeled biochemical compounds ,- 
(2) contacting the combinatorial library of labeled 
biochemical compounds with a target receptor molecule 
so that a selected labeled compound binds to the target 
receptor molecule; and (3) determining the conformation 
of the selected labeled compound when bound to the 
receptor molecule . 

Description of the Preferred Embodiment =; 

According to the present invention, methods for 
producing combinatorial libraries of labeled 
biochemical compounds and combinatorial libraries of 
labeled biochemical compounds per se are provided. The 
methods of the present invention include (I) producing 
the individual units of the biochemical compounds in 
labeled form; (2) combining at least two of the labeled 
individual units so as to produce a biochemical 
compound in labeled foirm; and (3) repeating step (2) at 
least once so as to produce a combinatorial library of 
labeled biochemical compounds . 

Further according to the present invention, 
methods for determining the conformation of a 
biochemical compound are provided. The methods of the 
present invention include (I) producing a combinatorial 
library of labeled biochemical compounds; (2) 
contacting the combinatorial library of labeled 
biochemical compounds with a target receptor molecule 
so that a selected labeled biochemical compound binds 
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to the carget receptor molecule; and (3) determining 
the conformation of the selected labeled biochemical 
compound when bound to the target receptor molecule. 

The term "individual unit" as used herein refers 
5 to components and derivatives of components of micro- 
organisnas . Examples of components of microorganisms 
include, but are not limited to, amino acids, nucleic 
acids, fatty acids, and carbohydrates. 

The term "nucleic acids" as used herein refers to 
10 nucleic acids and derivatives thereof as well as to 
breakdown products of nucleic acids such as, for 
example, nucleotides and nucleosides. 

The term "combinatorial library" as used herein 
refers to a collection of biochemical compounds. A 
15 combinatorial library consists of at least two 
biochemical compounds . 

The term "selected labeled biochemical compound" 
as used herein refers to a labeled compound which binds 
to a target receptor molecule and, thus, is "selected" 

2 0 by virtue of the fact that it binds to the target 

receptor molecule. 

According to one preferred embodiment of the 
present invention, a combinatorial library of labeled 
biochemical compounds is produced. To make a 
25 combinatorial library of labeled biochemical compounds 
in accordance with the present invention, labeled 
individual units are first produced. As noted 
previously, the individual units are components, or 
derivatives of components, of microorganisms. The 

3 0 individual units become "labeled" as a result of 

growing the microorganisms in an environment enriched 
with stable isotopes. Preferably, the enriched 
environment contains stable NMR active isotopes such as 
^H, ^'C or ^^N, or any combinations thereof. 
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Any suitable means for growing microorganisms in 
an enriched environment can be used in accordance with 
the present invention. Many techniques have been 
published for the isotopic enrichment of micro- 
5 organisms, including growth of bacteria in the presence 
of labeled carbohydrate and salts (Kay, L. , et al . , 
Science , 249:411 (1990) and references cited therein), 
growth of bacteria in algal lysates (U.S. Patent No. 
5,324, S58; Chubb, R.T., et al . , Biochemistry . 30:7718 

10 (1991)), growth of yeast in algal lysates (Powers, R. , 

et al.. Biochemistry , 31:4334 (1992)), growth of 
bacteria and yeast in labeled methanol (Moat, A.G. and 
Foster, J. VI., Microbial Physiology , 2nd Ed., John 
Wiley & Sons, New York:, p. 218 (1988)), growth of 

15 mammalian and insect cells in labeled media (U.S. 

Patent No. 5,324,658) and the phototropic culture of 
algae in the presence of isotopically labeled ^^COj 
and/or ^^N salts (U.S. Patent Nos. 4,952,511, 5,104,803, 
5,151,347 and 5,162,051). All of the above cited 

2 0 references are herein incorporated by reference. 

In accordance with the present invention, any 
components of the above listed microorganisms can be 
used as the individual units for the biochemical 
compounds in the combinatorial libraries. Each 
25 microorganism contains many potential components which 
are produced in abundance by such organisms . 
Preferably, the individual units are amino acids, 
nucleic acids, fatty acids or carbohydrates, or 
derivatives thereof. Most preferably, the individual 

3 0 units are amino acids or nucleic acids. Examples 1 and 

2 , infra , are directed to the production of labeled 
amino acids and labeled nucleic acids , respectively . 
The components themselves are generally multi-valent . 
Accordingly, these components can be coupled one to 



CA 02225313 1997-12-19 

PCT/US96/10526 



another to form the mixtures of biochemical compounds 
found in the combinatorial libraries. 

Isolating these components fr-om the environment in 
which the. microorganisms are grown (i.e., biomass) can 
5 be performed through any suitable means. For example, 
amino acids can be isolated from biomass by the 
techniques described in U.S. Patent No. 5,324,658 and 
Moore, S. and Stein, W.T., J. Biol . Cham. , 192:663 
(1951) , nucleic acids can be isolated by the techniques 

10 described in Maniatis, T. , Fritsch, E.F., and Sambrook, 
J . , Molecular Clonincr, Cold Springs Harbor Laboratory 
(1982) , and fatty acids can be produced and isolated as 
described in U.S. Patent No. 5,376,540. All of the 
cibove cited references are herein incorporated by 

15 reference. 

Preparation of Amino Acid Individual Units 
According to a preferred aspect of the present 
invention, the individual units are isotopically 
labeled amino acids, preferably isolated from algal 
2 0 biomass. Many techniques exist for the culture of 

algal species in isotopically enriched environments. 
Preferably, a closed photobioreactor as described in 
U.S. Patent Nos. 4,952,511, 5,104,803, 5,151,347 or 
5,162,051, the disclosure of which are incorporated 

2 5 herein by reference, is employed- To maximize the size 

of the combinatorial library, it is important to obtain 
as many amino acids as possible from the biomass. For 
this reason, it is preferable to use a procedure that 
does not destroy any of the labile amino acids present . 

3 0 Accordingly, the biomass is preferably processed by the 

procedure described in PCT Application Publication 
Number WO9401329, a procedure which yields aspartic and 
glutamic acids, asparagine, glutamine, arginine and 
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cysteine as discrete amino acids and the remaining 
amino acids mixed together as a lyophilized powder. 

According to this procedure, the amino acids 
obtained in the lyophilized powder are then separated. 
Many methods for the separation of amino acids have 
been described on an analytical scale, and can be used 
in accordance with the present invention. Preferably, 
however, an adaption of the separation method of Moore, 
S. and Stein, W.T. , J. Biol. Chem. . 192:663 (1951) is 
employed. In the Moore and Stein method, the amino 
acids are separated using a cation exchange resin 
eluted with a citrate buffer containing low levels of 
reagents such as benzyl alcohol and thiodiglycol . The 
temperature of the column and eluant is raised to 60° C 
to effect the separation of isoieucine/methionine and 
phenyl alanine /tyro sine which otherwise elute together. 

Although the Moore and Stein separation method is 
a valuable method for the analysis- of mixtures of amino 
acids, it is not easily performed on a large scale due 
to its requirement for reagents such as thiodiglycol 
which are unpleasant to handle, and its requirement 
that the temperature of the column be raised above 
ambient temperature. 

According to a preferred aspect of the present 
invention, the amino acids obtained in a lyophilized 
powder are separated by chromatography on a cation 
exchange resin. In principle, any form of the resin 
may be used. Preferably, however, the resin is in the 
form of a simple metal ion such as sodium or potassium, 
with sodium being most preferred. The resin is 
preconditioned in a solution of the metal ion in a 
suitable salt form. In principle, any salt form may be 
used. Preferably, the salt is that of a weak 
carboxylic acid, such as acetate or citrate, thereby 
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buffering the pH of the solution. The concentration of 
the buffer is preferably in the range of about 0.05M to 
about 0.5M, with the concentration of 0 . IM being most 
preferred. The pH of the buffer is adjusted to between 
5 about 3 . 0 and about 3.5, with a pH of 3 . 2 being 

particularly preferred. No other reagents need to be 
added to the eluant which is maintained at ambient 
temperature throughout the process. 

The amino acids in the lyophilized powder are 

10 dissolved in the buffer solution. It will be 

appreciated by those skilled in the art that a higher 
degree of resolution of the eluting amino acids will be 
obtained if the amino acids are dissolved in a small 
volume of buffer. Accordingly, it is preferred that 

15 the amino acids are dissolved at a concentration of 

about Ig amino acids per about 5 mi to about 50 ml of 
buffer, with Ig per 10 ml being particularly preferred. 
At this concentration, all of the amino acids obtained 
in the lyophilized powder are soluble, except for 

2 0 perhaps a proportion of the tyrosine present. This 

proportion of tyrosine, if present, can be isolated as 
a purified material by any suitable means such as 
filtrauion or centrif ugation. 

The buffer solution containing the amino acids is 
25 applied to the column, and buffer solution not 

containing amino acids is pumped through the column. 
The amino acids threonine, serine, proline, alanine, 
glycine and valine are eluted from the column in 
substantially pure form. The pH of the elution buffer 

3 0 is changed to between about 4 and about 5, with a pH of 

4.2 being particularly preferred, and the buffer is 
pumped through the column. The amino acids isoleucine, 
methionine, leucine, phenylalanine and tyrosine are 
eluted from the column in substantially pure form. 
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The pH of the eluting buffer is changed to a pH in 
the range of about 8.0 to about 8.5, with a pH of 8.2 
being particularly preferred, and the buffer is pumped 
through the column. Histidine is eluted from the 
5 column in substantially pure form. The pH of the 
eluting buffer is increased to between about 9 and 
about 10, with a pH of 9.3 being particularly 
preferred, and the buffer is pumped through the column. 
The amino acids lysine and tryptophan are then eluted 

10 in substantially pure form. 

Each of the purified amino acids is isolated from 
the eluting buffer by any suitable means, such as by 
the use of ion exchange chromatography. Preferably, 
the following modification of the procedure described 

15 in PCT Application Publ ication Number WO9401329 is 

employed. The fractions containing a particular amino 
acid are combined and an acidifying agent is used to 
adjust the pH of this solution to between about 1 and 
about 3, with a pH of 2 being particularly preferred. 

20 The acidifying agent may be any strong acid. 

Preferably, the acidifying agent is hydrochloric acid. 
The resulting solution is applied to a cation exchange 
resin in the same ionic form as the eluting buffer. 
Preferably, the resin is in the sodium foirm. A large 

25 quantity of resin is preferably employed to ensure that 
no loss of bound amino acid occurs. Preferably, about 

2 to about 3 kg of resin are employed per approximately 

3 0 liters of eluant pumped through the column. 

The column is then washed with water until the pH 
3 0 of the eluant is between about 4 and about 5. 

Following the water wash, the amino acid adhering to 
the cation exchange resin may be removed by elution 
with a basic medium. In principle, any basic medium 
can be used. Preferably, the basic medium employed has 
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the same cation as the eluant buffer. A preferred 
basic medium is sodium hydroxide. The pH of the basic 
medium is such that the amino functions of the amino 
acid are deprotonated, while the carboxylate 
5 function (s) of the amino acid are negatively charged. 
The pH of the basic medium is thus preferably greater 
than a pH of about 10. However, to avoid racemization 
of the amino acid under conditions which are too 
strongly basic, the pH of the basic medium is less than 
10 about 13, preferably in the range of about 11 to about 
12. The basic medium neutralizes the acidic cation 
exchange resin with concomitant elution of the bound 
amino acid. 

The purification of the particular amino acid may 

15 cease at this point . It is preferable however to 

concentrate the amino acid further. Preferably, the 
eluate from the now neutralized acid ion exchange 
column is added to a column of anion exchange resin in 
a basic form. In principle, any basic form of the 

20 resin can be employed. Preferably, a simple basic form 
such as hydroxide is used. In order to concentrate the 
amino acid, the amount of anion exchange resin used is 
considerably smaller than the amount of cation exchange 
resin, from which the amino acids have been eluted, is 

25 used. The size of the column is preferably in the 

range of about lOOg to about 750g, with the range 200g 
to SOOg being particularly preferred. The amino acid 
is absorbed onto the basic ion exchange resin because 
while its amino groups now carry no positive charge, 

3 0 its carboxyl function (s) are now negatively charged. 

The basic medium is then removed from the OH- 
column by elution with water. It is important that no 
basic medium is left in contact with the amino acid 
bound to the resin. Accordingly, it is preferred that 
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approximately 2 to 8 volumes of water are used to wash 
the column. Most preferably, at least 4 bed volumes 
are used. 

Following removal of the basic medium by elution 
5 with water, the concentrated amino acid is removed from 
the basic anion exchange resin by elution with acid. 
In principle, any acid solution may be used. 
Preferably, a weak acid such as formic or acetic acid, 
or a dilute mineral acid such as hydrochloric acid may 

10 be used. The concentration of acid used is such that 
the pH of the acidic solution is in the range of 
between about 2 and about 6 , preferably in the range of 
3 to 5. At this point, the purified amino acids can be 
isolated by any suitable means such as by evaporation 

15 lander reduced pressure or lyophilization . The thus 
purified amino acid may be further purified by any 
suitable means such as by crystallization, if required. 

These procedures enable the preparation of amino 
acids labeled with any combination of stable isotopes, 

20 preferably the NMR active stable isotopes ^H, "c and/or 

Generation of Labeled Combinatorial Libraries 
According to the present invention, once the 
labeled individual units have been produced, at least 

25 two, and preferably at least three, labeled individual 
units are combined to form a labeled biochemical 
compound. For example, two labeled amino acids can be 
combined to form a labeled biochemical compound which 
is two amino acids in length, or two labeled nucleic 

30 acids can be combined to form a labeled biochemical 

compound which is two nucleic acids in length. Where 
the individual units are amino acids, each of the 
labeled amino acids is protected at the alpha-amino 
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position by a suitable protecting group before 
combining the amino acids . Such protecting groups are 
readily available. Examples of such protecting groups 
include, but are not limited to, t-boc, f-moc and the 
5 like. Additional suitable protecting groups to protect 
side chains on the amino acids can be used, if 
necessary. For example, S-benzyl groups can be used to 
protect the side-chains of cysteine. (For a review of 
protecting group chemistry, including that of amino 

10 acids, see Greene, T. and Watts, P.G.M., Protective 

Group Chemistry , 2nd Ed., John Wiley & Sons, New York, 
(19 91), incorporated herein by reference.) 

Each of the protected amino acids is activated at 
the alpha -carboxyl position by an activating reagent. 

15 Suitable activating reagents are readily available. 

Examples of such activating reagents include, but are 
not limited to, oxalyl chloride, diimidazolecarbonyl 
and the like. The activated protected labeled amino 
acids can be combined to form a labeled biochemical 

20 compound. The combination of the amino acids can be 
performed by any suitable means, such as those 
disclosed by Geisen (at the CBD Conference) , Houghten 
(Houghten, Current Biolocr/ , Vol. 4, No. 6, p. 564 
(1994)) or Lam (Lam et al . , Nature . Vol. 354, p. 82 

25 (1991) ) , all of which are incorporated herein by 

reference. 

Alternatively, where the individual units are 
nucleic acids, any suitable means can be used to 
generate a combinatorial library of labeled biochemical 
3 0 compounds. Preferred techniques include linking the 
individual units together by chemical synthesis and 
combining the individual units using enzyme catalyzed 
polymerization. In the chemical synthesis method, each 
of the nucleic acids is isolated as the free 
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nucleoside, preferably by the techniques described in 
Example 2 . Each of the nucleosides is then protected 
at the 5' position and, if necessary, each nucleoside 
is also protected at the exocyclic amino function. Any 
5 suitable protecting groups can be used. Examples of 

suitable 5 ' protecting groups include , but are not 
limited to, trityl, dimethoxytrityl , pixyl and the 
like. Examples of suitable exocyclic protecting groups 
are acetyl, isobutyryl , phenoxyacetyl and the like. 

10 The 3' position of the nucleoside is then 

phosphorylated . Phosphorylation of the 3' position can 
be accomplished by treating the derivatized (i.e., 
protected) nucleoside with a suitable phosphorylating 
agent. Preferably, the phosphorylating agent is a 

15 phosphoramidite reagent such as 2-cyanoethyl N,N- 
diisopropylchlorophosphoramidite or methyl N,N- 
diisopropylchlorophosphoramidite and the like . 

Each of the protected nucleic acids is then 
activated at the derivatized phosphite group using an 

20 activating agent . Suitable activating agents are 

available and known to those skilled in the art and 
include, but are not limited to, tetrazole, triazole 
and the like. The activated protected nucleic acids 
can be combined to foirm a labeled biochemical compound. 

25 The combination of the nucleic acids can be performed 
by any suitable means, such as the method of Wang et 
al . , Biochemistry , 32:11285 (1993), which is included 
herein by reference . 

Alternatively, in the enzyme catalyzed 

3 0 polymeration method, the nucleic acid individual units 
can be combined to form a labeled biochemical compound. 
In this method, the labeled individual units are 
isolated as the 5 ' -monophosphates , converted 
enzymatically to the 5' -triphosphate, and combined 
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using an enzyme catalyst and a suitable template. 
Examples of such combinations include DNA extension on 
a DNA template (PGR) , and. DNA extension on a RNA 
template (RT) . 

5 Accoirding to the present invention, at least two, 

and preferably at least three, individual units are 
combined to form a labeled biochemical compound which 
is at least two, and preferably at least three, 
individual units in size. It is to be understood that 

10 compounds of any size can be produced in accordance 

with. the present invention, the size of the biochemical 
compound depending upon the number of individual units 
combined to produce the biochemical compound. 

According to another aspect of the present 

15 invention, the process of producing a labeled 

biochemical compound is repeated at least one time so 
that a- library containing at least two labeled 
biochemical compounds exists . It is to be understood 
that the combinatorial libraries of labeled biochemical 

20 compounds of the present invention can contain any 

number of labeled compounds , the number of compounds 
contained in the combinatorial library depending upon 
the number of times the process of producing a labeled 
biochemical compound is repeated. 

25 For example, if the process is repeated 1000 

times, a combinatorial library containing 10 0 0 labeled 
biochemical compounds will be produced. Thus, the 
present invention provides for the production of 
combinatorial libraries of labeled biochemical 

30 compounds containing thousands, millions, or billions 
of compounds, the size of the libraries being 
determined by the number of times the process of 
producing a labeled biochemical compound is repeated. 
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Furthermore, it is to be understood that the 
production of labeled biochemical compounds does not 
have to occur sequentially (i.e., one at a time) 
Rather, it is preferred that labeled biochemical 
5 compounds be produced simultaneously so that less time 

is required to produce a desired number of labeled 
biochemical compounds. Simultaneous production of 
labeled biochemical compounds occurs, for example, when 
a mixture of all amino acids (20) is combined with a 

10 single amino acid, thereby creating 2 0 different 
compoxmds two amino acids in length. 

The production of combinatorial libraries of 
labeled biochemical compounds containing such large 
numbers of compounds enables the speedy determination 

15 of the conformation of bound candidate drugs using the 
combinatorial library approach, regardless of whether 
or not the structure of the target receptor molecule 
with which the combinatorial library is being contacted 
is known. 

20 Selection of Labeled 

Compounds by Binding- to Tarcret Receptor 
The methods described herein produce combinatorial 
libraries of labeled biochemical compounds . These 
combinatorial libraries of labeled biochemical 

25 compounds are totally compatible with methods used to 
screen and select molecules which bind to a selected 
target receptor molecule. Therefore, these 
combinatorial libraries can be used in accordance with 
combinatorial library approaches known in the art. In 

30 contrast to past approaches, however, the libraries of 
the present invention enable the conformation of the 
biochemical compoxmd bound to the target receptor 
molecule to be examined by NMR by virtue of the active 
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Stable isotopes incorporated into the biochemical 
compound, regardless of whether or not the conformat: 
of the target receptor molecule is known. 



Determination of Conformation 
of Bound Labeled Compound 

According to a preferred embodiment of the present 
invention, the conformation of a selected biochemical 
compound is determined. To determine the conformation 
of a selected biochemical compound, a combinatorial 
library of labeled biochemical compounds, where the 
compounds are labeled with at least one stable isotope, 
must first be produced. It is preferred that the 
biochemical compounds are labeled with stable NMR 
active isotope (s) . Furthermore, it is preferred that 
the combinatorial library is produced in accordance 
with the procedures detailed previously in this 
application. 

Once the combinatorial library of labeled 
biochemical compounds is produced, the combinatorial 
libra2ry is contacted with a target receptor molecule so 
that a selected labeled biochemical compound can bind 
to the target receptor molecule. 

Once the selected labeled biochemical compound is 
bound to the target receptor molecule, the conformation 
of the selected labeled biochemical compound bound to 
its target receptor can be determined due to the 
presence of the stable isotope label. For example, if 
the selected biochemical compound is labeled with 
stable NMR active isotopes, the conformation of the 
selected biochemical compound can be determined through 
NMR analysis (for a detailed review of NMR techniques, 
see. Burger's Medicinal Chemistry and Drug Discovery, 
Vol. 1, Principles and Practice, Ed. Manfred E. Wolff, 
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pages 314-315 and references cited therein) . 
Obviously, the method of determination of the 
conformation of the selected labeled biochemical 
compound will depend upon the type of label employed. 
5 One important aspect of the pr-esent invention is 

that the conformation of a selected labeled biochemical 
compound can be determined without knowing the 
conformation of the target receptor molecule to which 
the selected biochemical compound is bound. 

10 Accordingly, the present invention can be used to 
determine the conformation of a selected labeled 
biochemical compound regardless of whether or not the 
conformation of the target receptor molecule is known. 
While the invention has been described and 

15 illustrated with details and references to certain 

preferred embodiments, those skilled in the art will 
appreciate that various modifications, changes, 
omissions and substitution can be made without 
departing from the spirit of the invention. 

20 Example 1 

Preparation of Isotopically Labeled Amino Acids 
5G%2H, 13C, 15N-labeled mixed amino acids 
(prepared according to the procedures described in PCT 
Application Publication Number WO9401329 EXAMPLE 9, 

25 22g) were added to sodium citrate (O.IM, pH 3.3, 
containing O.OOIM sodium azide, 250 ml) and the 
suspension stirred at room temperature for 1 hour. 
Undissolved material (principally tyrosine) was removed 
by centrifugation and the supernatant added to a column 

30 (15x200cm) of Dowex 50WX4, 200-400 mesh preconditioned 

in sodium citrate (O.IM, pH 3.3, containing O.OOIM 
sodium azide) . The column was eluted with sodium 
citrate (O.IM, pH 3.3, containing O.OOIM sodium azide) 
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at room temperature at a flow rate of 25 ral/min. 2 
litre fractions being collected. Threonine was eluted 
in fractions 33-38, serine in fractions 41-50, proline 
in fractions 53-60, alanine in fractions 71-74, glycine 
5 in fractions 75-87, and valine in fractions 92-99. 

The eluant was changed at fraction 102 to sodium 
citrate (O.IM, pH 4.2, containing 0 . OOIM sodium azide) 
and elution continued at room temperature at the same 
flow rate as before. Isoleucine was eluted in 

10 fractions 123-124, methionine was eluted in fractions 
126-127, leucine was eluted in fractions 129-135, 
phenylalanine was eluted in fractions 170-177, and 
tyrosine was eluted in fractions 183-189. 

The eluant was changed at fraction 201 to sodium 

15 citrate (O.IM, pH 6.3, containing 0 , OOIM sodium azide) 

and at fraction 222 to sodium citrate (O.IM, pH 8.2, 
containing O.OOIM sodium azide). In both cases elution 
continued at room temperature at the same flow rate as 
before. Histidine was eluted in fractions 238-241. 

20 The eluant was changed at fraction 2 61 to sodium 

citrate (O.IM, pH 9.3, containing O.OOIM sodium azide) 
and elution continued at room temperature at the same 
flow rate as before. Tryptophan was eluted in 
fractions 274-278 and lysine was eluted in fractions 

25 279-281. 

Each of the amino acids was isolated by the 
following procedure. The Leucine -containing fractions 
were combined and the pH adjusted with concentrated 
hydrochloric acid to pH 2 . The resulting solution was 

3 0 pumped at 2 00 ral/min. onto Dowex 5 0WX4, 200-400 mesh 

ion exchange resin (sodium form, 2 . SKg) . The resin was 
then washed from water at 200 ml/min. until the pH of 
the eluate was approximately pH 5 . The pump was 
stopped and the column connected at its base to a 
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column of Dowex 50X8-100 ion exchange resin (OH" form, 
500g) . Sodium hydroxide (pH 12) was pumped at 100 
ml/min. through the columns until the pH of the eluant 
from the OH" column was approximately 12 . Pumping 
5 continued for a further 5 litres . The pump was then 

stopped, connected directly to the OH" column and water 
pumped at 100 ml/rain, through the OH" column until the 
pH of the resulting eluant was approximately pH 8 . 

Aqueous acetic acid was then pumped through the OH" 
10 . column at 100 ml/min. and fractions (100 ml) collected. 
Ninhydrin positive fractions were pooled, and 
evaporated under reduced pressure. The resulting solid 
was crystallized from aqueous ethanol to yield 50% 2H, 
13C, 15N-leucine, l.lg. 

15 g^^ampl^ 2 

Production of Uniformly Stable Isoto pically 
Labeled Nucleosides and Nucleotides from Labeled 
Prototrophicallv Grown Biomass 

Biomass is obtained from a prototroph {e.g., 

20 Chlorella. -vulgaris) by growing the organism on very 

simple stable isotopically labeled carbon and nitrogen 

sources (e.g., for Chlorella. vulgaris "COj and ^^NOj or 

"NHj) . The cells are harvested by centrif ugation in 

batch or using a continuous centrifuge. Cells are 

25 disrupted using any of a number of techniques to 

release the nucleic acids (e.g., cavitation). Total 

nucleic acids are extracted following a fairly standard 

procedure (taking care to maintain an RNase free system 

at all times) . In one method, the broken biomass can 

30 be extracted with an equal volume of 24:24:1 Tris (pH 

8.0) saturated phenol : chloroform: isoamyl alcohol and 

vigorously shaken. This is separated into phases by 

centrif ugation. The aqueous phase is extracted with 

24:1 chloroform: isoamyl alcohol, shaken vigorously and 
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separated into phases by centrifugation. The aqueous 
phase is supplemented with ammonium acetate and the 
nucleic acids salted out with an equal volume of 
isopropanol at cold temperature. The resulting 
5 precipitated nucleic acids are washed with ethanol 
containing ammonium acetate to remove excess salts . 
The final pellets are dried to remove the ammonium 
acetate. The dried pellets are dissolved in a Tris (pH 
8.0) plus magnesium chloride and EDTA buffer at a 

10 minimum volume. 

The nucleic acids are quantitated 
spectrophotometrically and treated with DNase I . This 
digestion is carried out at 31 °C for 6-8 hours in the 
presence of pancreatic RNase inhibitor. A second spike 

15 of DNase I is added after the initial incubation to 

assure total digestion of the DNA into small fragments 
{tetramers or smaller) . After a total of 24 hours 
incubation the reaction mixture is placed in a 
ultrafiltration stirred cell fitted with a 3,000 MWCO 

20 membrane and allowed to go to a minimum volume while 

collecting permeate. The retentate is then washed with 
Tris buffer two times to remove residual mono- and 
oligo-deoxynucleotides . All of the permeates are 
freeze dried. At this point the material is suspended 

25 in ammonium acetate buffer and treated with nuclease PI 
to breakdown the small oligomers to monomer 
deoxyribonucleotide 5' monophosphates (NMPs) . The dNMP 
digested mixture is put through a 3,000 MWCO filter to 
remove enzyme and the retentate is washed once with 

3 0 water. The permeates are tested by HPLC, combined, 
then freeze dried to remove ammonium acetate . These 
may need desalting at this point depending on their 
eventual usage . 
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The stirred cell retentate from the DNase I 
digestion is now treated with nuclease Pi to breakdown 
residual material (almost entirely RNA) into monomers. 
This incubation is done in ammonium acetate buffer and 
5 the entire stirred cell is moved to a 37°C incubator 
for overnight digestion. Once complete digestion is 
verified by HPLC, the material is filtered through a 
3,000 MWCO filter until a minimal volume is retained. 
The retentate is washed twice with distilled water. 

10 All the permeates containing significant amounts of 
ribonucleotide 5' monophosphates are pooled and 
dialyzed versus distilled water to remove excess salt . 
This is then freeze dried to provide ribonucleotide 5 ' 
monophosphates (NMPs) . 

15 Depending on the starting biomass additional 

labeled biochemicals may be obtained from these 
preparations. For example, with Chlorella vulgaris the 
cells are broken and centrifuged and a starch pellet is 
recovered that can be used either as labeled starch or 

2 0 for further processing to glucose. 
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What is claimed • 

1. A method of producing a combinatoirial libra2ry 
of labeled biochemical compounds comprising the steps 
of: 

(a) producing the individual units of the 
biochemical compounds in labeled form; 

(b) combining at least two of the labeled 
individual units so as to produce a biochemical 
compound in labeled form capable of having its 
conformation determined; 

(c) repeating step (b) at least once so as 
to produce a combinatorial library of labeled 
biochemical compounds . 

2. The method of claim 1, wherein said 
individual units are selected from the group consisting 
of amino acids, nucleic acids, fatty acids, and 
carbohydrates, or derivatives thereof. 

3. The method of claim 2, wherein said 
individual units are amino acids. 

4. The method of claim 2, wherein said 
individual units are nucleic acids . 

5. The method of claim 3, wherein the labeled 
amino acids are produced by a method comprising the 
steps of: 

(a) producing labeled amino acids in an 
algal biomass; and 

(b) recovering said labeled amino acids from 
the algal biomass. 
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6. The method of claim 1, wherein the 
combination of individual units so as to produce more 
than one labeled biochemical compound occurs 
simultaneously. 

7. A method of producing a combinatorial library 
of labeled biochemical compounds comprising the steos 
of : 

(a) producing labeled amino acids; 

(b) combining at least two of the labeled 
amino acids so as to produce a labeled biochemical 
compound capable of having its conformation determined; 

(c) repeating step (b) at least once so as 
to produce a combinatorial library of labeled 
biochemical compounds. 

8. A method of producing a combinatorial library 
of labeled biochemical compounds comprising the steps 
of : 

(a) producing labeled nucleic acids; 

(b) combining at least two of the labeled 
nucleic acids so as to produce a labeled biochemical 
compound capable of having its conformation determined; 

(c) repeating step (b) at least once so as 
to produce a combinatorial library of labeled 
biochemical compounds . 

9. A method of searching for a candidate drug 
using a combinatorial library approach, the improvement 
comprising using the combinatorial library of labeled 
biochemical compounds, produced in claim 1. 
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10. A combinatorial library of labeled 
biochemical compounds, each compound capable of having 
xts conformation determined, wherein said library 
contains at least 1,000 labeled biochemical compounds. 

11- The combinatorial library of claim io 
Wherein said library contains at least 10 , 000 'labeled 
biochemical compounds. ^eiea 

12. The combinatorial library of claim lo 
wherein said library contains at least 100,000 labeled 
ioiochemical compounds . 

13. The combinatorial library of claim 10, 
wherein said library contains at least l,ooo,ooo 
labeled biochemical compounds. 

14. The combinatorial library of claim 10 
Wherein said library contains at least lo,ooo 000 
labeled biochemical compounds. 

15. The combinatorial library of claim lo, 
Wherein the library is produced by the process of claim 



The combinatorial library of claim lo 
wherein the labeled biochemical compounds comprise 



ammo acids . 



17. The combinatorial library of claim 10, 
wherexn the labeled biochemical compounds comprise 
nucleic acids. 
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ls. The combinatorial library of claim 10, 
wherein the labeled biochemical compounds are labelled 
with NMR active isotopes. 

19. A method of determining the conformation of a 
selected biochemical compound which comprises-. 

(a) producing a combinatorial library of 
labeled biochemical compounds ,- 

(b) contacting the combinatorial library of 
labeled biochemical compounds with a target receptor 
molecule so that a selected labeled biochemical 
compound binds to the target receptor m.olecule; and 

(c) determining the conformation of the 
selected labeled biochemical compound when bound to the 
receptor molecule. 

20. The method of claim 23, wherein the stable 
isotopes are NMR active isotopes. 

21. The method of claim 20, wherein the 
conformation of the labeled selected biochemical 
compound is determined through NMR analysis. 

22. The method of claim 19, wherein the 
conformation of the receptor molecule is unknown. 

23. The method of claim 19, wherein the 
biochemical compounds are labeled with stable isotopes. 

24 The method of claim 19, wherein the 
combinatorial library of labeled biochemical compounds 
is produced by a method comprising the steps of: 

(a) producing the individual units of the 
biochemical compounds in labeled form; 
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(b) combining at least two of the labeled 
individual units so as to produce a biochemical 
compound in labeled form; 

(c) repeating step (b) at least once so as 
to produce a combinatorial library of labeled 
biochemical compounds . 

25. The method of claim 24, wherein said 
individual units are selected from the group consisting 
of amino acids, nucleic acids, fatty acids, and 

carbohydrates, or derivatives thereof. 

26. The method of claim 25, wherein said 
individual units are amino acids . 

27. The method of claim 26, wherein the labeled 
amino acids are produced by a method comprising the 
steps of: 

(a) producing labeled amino acids in an 
algal biomass; and 

(b) recovering said labeled amino acids from 
the algal biomass.- 

28. The method of claim 24, wherein the 
combination of individual units so as to produce more 
than one labeled biochemical compound occurs 
simultaneously . 

29. The method of claim 19, wherein the 
combinatorial librairy of labeled biochemical compounds 
is produced by a method comprising the steps of: 

(a) producing labeled amino acids; 

(b) combining at least two of the labeled 
amino acids so as to produce a labeled biochemical 
compound; 
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(c) repeating step (b) at least once so as 
to produce a combinatorial library of labeled 

biochemical compounds . 



